SIMD-0182: Consume requested CUs for sBPF failures #182

tao-stones · 2024-10-04T14:16:00Z

No description provided.

Co-authored-by: Philip Taffet <[email protected]>

proposals/simd-0182-conditional-cu-metering.md

buffalojoec · 2024-10-10T14:10:51Z

proposals/simd-0182-conditional-cu-metering.md

+If VM execution returns any error except `SyscallError`, transaction's CU meter
+should be fully depleted, in another words, all requested CUs are consumed;
+otherwise consumes the actual executed CUs.


Just out of curiosity, why make syscall errors exempt?

My understanding is syscall not happen in VM realm. @Lichtso @ptaffet-jump please correct me if I'm wrong.

Syscall errors are never thrown in the middle of basic blocks, so we can easily know exactly how many CUs were consumed when the error was thrown. This is why we can safely make them exempt.

In the usual definition of basic blocks, call instructions can occur anywhere in a basic block (apart from the last instruction). So I would say this is unclear.

Also, it does not explain the need for the exemption. I would expect all CUs to be all consumed if a program failed, including if that failure was in a syscall.

I would expect all CUs to be all consumed if a program failed, including if that failure was in a syscall.

Why would you expect that? If a program makes a CPI and that invoked program returns an error, we immediately stop executing the transaction and therefore have more time to execute other transactions in a block. If we consume all remaining CUs for that failure, we are wasting that capacity for execution.

@seanyoung: This is mostly about how easy / hard it is to pinpoint the precise fault location. During syscalls the VM has stopped anyway and the return address is known on the stack. Otherwise it can be any location since the last metering (which happens at controlflow instructions). Also syscall errors are far more common and expected to happen regularly, whereas other faults such as access violation only happen in broken programs.

@Lichtso that makes sense 👍

proposals/simd-0182-conditional-cu-metering.md

jstarry · 2024-11-20T21:28:19Z

proposals/simd-0182-conditional-cu-metering.md

+
+## Detailed Design
+
+If VM execution returns any error except `SyscallError`, transaction's CU meter


I see that with direct mapping enabled, we convert EbpfError::AccessViolation into the SyscallError variant here: https://github.com/anza-xyz/agave/blob/af0ed22174999cb62579a0621f6b274c85ebf267/programs/bpf_loader/src/lib.rs#L1501

I am assuming that this access violation should also deplete compute units and should not be considered as an exception right?

@Lichtso @alessandrod

Yes, same reasoning as: #182 (comment)

Should consume all cu's or not?

Yes, we should. This is not caused by a syscall so the exact instruction counter would have to be calculated otherwise.

Just want to call out that this type of access violation is probably going to be fairly common when direct mapping is enabled. So it's not exactly true to say that these types of errors are "rare, exceptional situations" or "A rare case" as we say in the SIMD. We should update the SIMD to say that these types of failures are "less common" rather than rare at least.

I see that with direct mapping enabled, we convert EbpfError::AccessViolation into the SyscallError variant here: https://github.com/anza-xyz/agave/blob/af0ed22174999cb62579a0621f6b274c85ebf267/programs/bpf_loader/src/lib.rs#L1501

I am assuming that this access violation should also deplete compute units and should not be considered as an exception right?

@tao-stones this should probably be made explicit in the SIMD. Maybe we can say that all EBPF errors from the vm are treated as irregular.

Mmm, imo, SIMD's text If VM execution returns any error except SyscallError, .... is clearer.

I don't think that a spec should be based on "arbitrary" rust types.
We should clarify what's the expected behavior, for example if there's a memory issue inside a syscall, is that a Syscall or Epbf error?

I agree that if we enter a syscall and that fails, we know the exact number of CUs consumed, and so there's no need to deplete them all. I'm not sure if checking the SyscallError type is the correct implementation though. (In the example above, imo a memory issue inside a syscall should NOT deplete, however I don't know if the error thrown is SyscallError or EbpfError)

Co-authored-by: Justin Starry <[email protected]>

proposals/simd-0182-conditional-cu-metering.md

Co-authored-by: Justin Starry <[email protected]>

… VM error as "less common"

topointon-jump

This is awesome

topointon-jump · 2024-11-22T04:34:34Z

proposals/simd-0182-conditional-cu-metering.md

+
+## Detailed Design
+
+If VM execution returns any error except `SyscallError`, transaction's CU meter


I see that with direct mapping enabled, we convert EbpfError::AccessViolation into the SyscallError variant here: https://github.com/anza-xyz/agave/blob/af0ed22174999cb62579a0621f6b274c85ebf267/programs/bpf_loader/src/lib.rs#L1501

I am assuming that this access violation should also deplete compute units and should not be considered as an exception right?

@tao-stones this should probably be made explicit in the SIMD. Maybe we can say that all EBPF errors from the vm are treated as irregular.

ravyu-jump

LGTM

t-nelson · 2024-12-11T17:21:28Z

should this be merged? there is already an implementation in agave master and backport PRs open

bw-solana · 2024-12-11T20:23:13Z

I support merging for Agave. CC @Benhawkins18

Benhawkins18

Approvals from both Jump and Anza. Merging

tao-stones and others added 9 commits September 26, 2024 14:57

draft

55e4924

Apply suggestions from code review

989a88f

Co-authored-by: Philip Taffet <[email protected]>

set textwidth

6f6e049

add irregular failure term

19a48f1

updated proposed design details

08a9f56

/typo

9a22dfc

updated detail section

d307391

clean up

30f088c

add simd id

d439937

tao-stones marked this pull request as ready for review October 4, 2024 17:31

github-actions bot mentioned this pull request Oct 7, 2024

Upstream Updates - Mon Oct 7 00:14:36 UTC 2024 smartcontractkit/chainlink-solana#880

Closed

Benhawkins18 changed the title ~~Conditional cu charging~~ SIMD-0182: Conditional cu charging Oct 8, 2024

buffalojoec reviewed Oct 10, 2024

View reviewed changes

jstarry reviewed Nov 20, 2024

View reviewed changes

proposals/simd-0182-conditional-cu-metering.md Outdated Show resolved Hide resolved

proposals/simd-0182-conditional-cu-metering.md Outdated Show resolved Hide resolved

proposals/simd-0182-conditional-cu-metering.md Outdated Show resolved Hide resolved

jstarry reviewed Nov 20, 2024

View reviewed changes

tao-stones and others added 3 commits November 20, 2024 16:01

Apply suggestions from code review

a69ec93

Co-authored-by: Justin Starry <[email protected]>

update

0aed128

fmt

0dfaa4d

jstarry reviewed Nov 20, 2024

View reviewed changes

proposals/simd-0182-conditional-cu-metering.md Outdated Show resolved Hide resolved

tao-stones and others added 2 commits November 21, 2024 09:15

Update proposals/simd-0182-conditional-cu-metering.md

7350797

Co-authored-by: Justin Starry <[email protected]>

lint

5194227

tao-stones changed the title ~~SIMD-0182: Conditional cu charging~~ SIMD-0182: Consume requested CUs for sBPF failures Nov 21, 2024

in anticipating furture changes such as direct mapping, describe such…

e8c74ac

… VM error as "less common"

topointon-jump approved these changes Nov 22, 2024

View reviewed changes

ravyu-jump approved these changes Nov 22, 2024

View reviewed changes

jstarry mentioned this pull request Nov 22, 2024

feat: deplete compute meter for vm errors anza-xyz/agave#3751

Merged

This was referenced Dec 5, 2024

v2.1: feat: deplete compute meter for vm errors (backport of #3751) anza-xyz/agave#3955

Open

v2.0: feat: deplete compute meter for vm errors (backport of #3751) anza-xyz/agave#3956

Open

This was referenced Dec 6, 2024

flamenco, vm: syscall err code refactor firedancer-io/firedancer#3655

Merged

flamenco, vm: (SIMD-0182) consume requested CUs for VM Interp failures firedancer-io/firedancer#3656

Merged

This was referenced Dec 7, 2024

Feature Gate: Consume all requested CUs for VM failures anza-xyz/agave#3993

Open

use new feature gate for cu depletion anza-xyz/agave#3994

Merged

This was referenced Dec 11, 2024

v2.0: use new feature gate for cu depletion (backport of #3994) anza-xyz/agave#4067

Open

v2.1: use new feature gate for cu depletion (backport of #3994) anza-xyz/agave#4068

Open

Benhawkins18 self-requested a review December 11, 2024 20:28

Benhawkins18 approved these changes Dec 11, 2024

View reviewed changes

Benhawkins18 merged commit 51e9a72 into solana-foundation:main Dec 11, 2024
2 checks passed

github-actions bot mentioned this pull request Dec 16, 2024

Upstream Updates - Mon Dec 16 00:16:10 UTC 2024 smartcontractkit/chainlink-solana#980

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SIMD-0182: Consume requested CUs for sBPF failures #182

SIMD-0182: Consume requested CUs for sBPF failures #182

tao-stones commented Oct 4, 2024

buffalojoec Oct 10, 2024

tao-stones Oct 10, 2024

topointon-jump Nov 3, 2024 •

edited

Loading

seanyoung Dec 3, 2024

jstarry Dec 3, 2024

Lichtso Dec 5, 2024

seanyoung Dec 5, 2024

jstarry Nov 20, 2024

jstarry Nov 20, 2024

Lichtso Nov 20, 2024

jstarry Nov 20, 2024

Lichtso Nov 21, 2024

jstarry Nov 21, 2024

tao-stones Nov 22, 2024

topointon-jump Nov 22, 2024

tao-stones Nov 22, 2024

0x0ece Dec 7, 2024

topointon-jump left a comment

topointon-jump Nov 22, 2024

ravyu-jump left a comment

t-nelson commented Dec 11, 2024

bw-solana commented Dec 11, 2024

Benhawkins18 left a comment


		## Detailed Design

		If VM execution returns any error except `SyscallError`, transaction's CU meter

SIMD-0182: Consume requested CUs for sBPF failures #182

SIMD-0182: Consume requested CUs for sBPF failures #182

Conversation

tao-stones commented Oct 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

topointon-jump Nov 3, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

topointon-jump left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ravyu-jump left a comment

Choose a reason for hiding this comment

t-nelson commented Dec 11, 2024

bw-solana commented Dec 11, 2024

Benhawkins18 left a comment

Choose a reason for hiding this comment

topointon-jump Nov 3, 2024 •

edited

Loading